Turner et al. BMC Evolutionary Biology 201 3, 13:269 
http://www.bionnedcentral.conn/1471 -21 48/1 3/269 



Evolutionary Biology 



RESEARCH ARTICLE Open Access 



Analyses of amplified fragment length 
polymorphisms (AFLP) indicate rapid radiation of 
Diospyros species (Ebenaceae) endemic to New 
Caledonia 

Barbara Turner^'', Ovidiu Paun\ Jerome Munzinger^, Sutee Duangjai^ Mark W Chase^'^ and Rosabelle Samuel^ 



Abstract 

Background: Radiation in some plant groups lias occurred on islands and due to the characteristic rapid pace of 
phenotypic evolution, standard molecular markers often provide insufficient variation for phylogenetic 
reconstruction. To resolve relationships within a clade of 21 closely related New Caledonian Diospyros species 
and evaluate species boundaries we analysed genome-wide DNA variation via amplified fragment length 
polymorphisms (AFLP). 

Results: A neighbour-joining (NJ) dendrogram based on Dice distances shows all species except D. minimifolia, 
D. parviflora and D. vieillardii to form unique clusters of genetically similar accessions. However, there was little 
variation between these species clusters, resulting in unresolved species relationships and a star-like general NJ 
topology. Correspondingly, analyses of molecular variance showed more variation within species than between 
them. A Bayesian analysis with BEAST produced a similar result. Another Bayesian method, this time a clustering 
method, STRUCTURE, demonstrated the presence of two groups, highly congruent with those observed in a 
principal coordinate analysis (PCO). Molecular divergence between the two groups is low and does not 
correspond to any hypothesised taxonomic, ecological or geographical patterns. 

Conclusions: We hypothesise that such a pattern could have been produced by rapid and complex evolution 
involving a widespread progenitor for which an initial split into two groups was followed by subsequent 
fragmentation into many diverging populations, which was followed by range expansion of then divergent 
entities. Overall, this process resulted in an opportunistic pattern of phenotypic diversification. The time since 
divergence was probably insufficient for some species to become genetically well-differentiated, resulting in 
progenitor/derivative relationships being exhibited in a few cases. In other cases, our analyses may have 
revealed evidence for the existence of cryptic species, for which more study of morphology and ecology are 
now required. 
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Background 

Island floras are often characterized by high levels of en- 
demism and groups of closely related but morphological 
and ecological divergent species that are mostly the result 
of single colonisation events followed by radiation e.g. 
[1,2]. New Caledonia was cited as one of the 34 biodiver- 
sity hotspots recognized by Conservation International 
[3,4]. Nearly 75% of the native flora is endemic [5], which 
is the fourth highest for an island [6]. While the continen- 
tal part of New Caledonia (mainly Grande Terre) was en- 
tirely submerged during the Eocene (until 37 mya), a thick 
layer of heavy-metal-rich oceanic mantle accumulated [7]. 
Today, around one-third of the main island, Grande Terre, 
is still overlaid with ultramafic substrates. Generally, 
Grande Terre is a substrate mosaic [8], which is cited as 
one reason for the high level of endemism found there e.g. 
[9]. The climate in New Caledonia ranges from tropical to 
subtropical, and the main island is split by a mountain 
range into a humid eastern and a dry western part with 
prevailing winds and rain coming from the south-east. 
Taking climatic and geological factors together, Grande 
Terre has a wide range of environmental diversity. The 
main vegetation types in New Caledonia are evergreen 
humid forests, maquis, dry forests, littoral vegetation, and 
savannah [10]. 

One plant group that has taken advantage of many avail- 
able habitats on New Caledonia is Diospyros, which is the 
largest genus (> 500 species in its broad circumscription 
[11]) of Ebenaceae, a pantropical family of woody plants. 
In New Caledonia Diospyros species range from sea level 
up to ca. 1250 m (the highest point New Caledonia is 
1628 m), and species are found in all vegetation types ex- 
cept mangroves, with several species co-occurring in 
micro-sympatry (Table 1). 

Diospyros colonised New Caledonia via long-distance 
dispersal at least four times [12]. In previous studies based 
on low-copy nuclear and/or multiple plastid markers 
[12,13], it was possible to resolve phylogenetic relationships 
for the majority of Diospyros species, except for one 



group of endemics from New Caledonia. Of the 31 New 
Caledonian Diospyros species, 24 belong to this clade of 
closely related endemics. In previous analyses, this 
strongly supported group is related to species found on 
islands throughout the Indian and Pacific Oceans as far 
east as Hawai'i [12,13]. However, due to extremely low 
levels of sequence divergence, it was not possible to 
tease apart relationships between these species (they 
formed a hard polytomy in most individual trees, and 
there was little informative variation that permitted 
clustering of pairs or groups of species). Most of these 
closely related species are morphologically and eco- 
logically clearly differentiated (for examples see [13]), 
and several species are narrow endemics restricted to 
small areas. 

Amplified fragment length polymorphism (AFLP; [14]) is 
a fingerprinting technique that has proven to be usefial for 
revealing phylogenetic relationship among closely related 
taxa (e.g. Hypochaeris, [15]; Lactuca, [16]; Phylica, [17]; 
Trollius, [18]; Ranunculus alpestris, [19]; Puya, [20,21]; 
Araucaria, [22]). In contrast to standard phylogenetic 
markers, AFLP variation is spread across the whole gen- 
ome, spanning both coding and non-coding DNA regions 
and may therefore be more representative of overall genetic 
patterns present as well as being highly informative for 
phylogenetic analyses at the low phylogenetic level [23,24]. 
Compared to other fingerprinting techniques AFLP shows 
increased reproducibility and does not require any prior 
knowledge of the analysed genomes. However, there are 
some detrimental issues to consider when working with 
AFLP data; these include potential non-homology and non- 
independence of fragments, asymmetry in the probability 
of loss/gain of fragments, and problems in distinguishing 
heterozygote from homozygote bands e.g. [23,25]. Despite 
these difficulties, several authors have used AFLPs to reveal 
phylogenetic relationships corroborated by analyses of 
other types of data, especially for species that have diverged 
recently or radiated within a short period of time e.g. 
[15,17,23,26]. 



Table 1 Occurrence of Diospyros species in different habitats in New Caledonia 



Substrate 



Limestone Serpentine 



Schist 



Ultramafic rock 



Volcanic rock 



Vegetation 



Humid mountain 
forest 

Humid low 
land forest 

Mesophyll forest 



Maquis 
Littoral forest 



D. minimifolio, 
D. pustuolo, 
D. tridentata 



D. calciphilo, 
D. inexplorata 



D. revolutissima 



D. porvifloro, 
D. trisuico 



D. glons, D. poncheri, 
D. porvifloro, D. umbroso 

D. erudito 



D. vieillordii 



D. flovocorpo, D. lobillordierei 
D. umbroso 

D. cherrieri, D. erudito, D. minimifolio, 
D. perplexo, D. pustuloto 



D. impolito 



Habitats are grouped according to vegetation type and substrate. Note that several species are co-occurring and that a few species are found In several habitats. 
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In this study we focus on this group of closely related 
species of Diospyros endemic to New Caledonia (Figure 1). 
Our aim was to clarify species boundaries as well as phylo- 
genetic relationships between these New Caledonian 
Diospyros species. Integrated in a broader context, the 
outcome of our research should help us better under- 
stand the factors behind and mechanisms of speciation 
and radiation on islands. 

Results 

After excluding 186 replicates the final matrix used for ana- 
lyses contained 192 individuals and 792 fragments. The 
AFLP profiles showed good reproducibility with a mean 
error-rate of 2.4% across all replicated samples. Because the 
focus of this study was on the phylogenetic relationships 
between species and species limits rather than intra- 
specific population genetics, we are presenting and dis- 
cussing mostly the results of inter-specific relationships. 
We are presenting here only unrooted trees due to the 
low resolution of their backbone. We analysed the data 
using neighbour-joining (NJ) dendrograms and princi- 
pal coordinate analysis (PCO) with different distance 
methods, and in both cases the Dice distance gave the 
highest resolution of relationships between species. 

The NJ analysis resulted in a star-like dendrogram with a 
backbone of short branches lacking bootstrap support 
greater than 75%. All species except D, minimifolia, D, par- 
vifiom and D. vieillardii form single clusters in the NJ tree 
(Figure 2A). However, only eight {D, calciphila, D, cherrieri, 
D, inexploratUy D, impolita, D, pustulata, D, trisulca, 
D, umbrosa and D, yahouensis) of the 21 included species 
form clusters with bootstrap higher than 80%. The Bayesian 
inference (BI) produces a similar result. All species except 
D, labillardiereU D, minimifolia, D, pancheri and D, parvi- 
flora form single clusters in the BI tree (Figure 2B). Apart 
from D, flavocarpa, D, revolutissima, D, tridentata and 
D, vieillardiU all clustered species have high (> 0.95) poster- 
ior probabilities. 

PCO separated accessions into two main groups (here- 
after named "white" and "grey") that can be subdivided into 
six subgroups (Figure 3). Within the "white" group (defined 
in the STRUCTURE results below) subgroup one includes 
D, vieillardii (individuals indicated by squares in Figure 3), 
subgroup two D, calciphila (triangles) and subgroup three 
the rest of the individuals from this group (circles). In the 
"grey" group (more extensively described in the STRUC- 
TURE results below) subgroup four included D, flavo- 
carpa, D, umbrosa and D, vieillardii (indicated by squares 
in Figure 3), subgroup five D, erudita and D, glans (trian- 
gles) and subgroup six the remaining individuals (circles). 
A PCO of populations (not shown) based on the pair-wise 
FsT distances obtained from the AMOVA resulted in simi- 
lar groups and subgroups of populations as those obtained 
from the individual-based PCO. STRUCTURE analysis 



gave the highest value of AK for K = 2 plus few other sub- 
optimal K values (Figure 4A and B). However the latter 
contained clusters with negligible membership ("empty" 
clusters). Both K = 3 and K = 6 resulted in three visible 
clusters, with one cluster being only found in significantly 
admixed samples (Additional file 1). Visualisation of K =16 
and K = 21 showed two clusters only and both analyses are 
highly similar to each other (Additional file 1). It has been 
argued the ad-hoc Evanno method [27] favours by default 
K =2 over K = 1 when searching for the correct number of 
clusters [28]. However, PCO separated individuals included 
in our analyses into two groups as well, and therefore we 
consider K = 2 as representative for our sample set. For 
K = 2, the allele-frequency divergence between the two 
groups was 0.0074. One group ("grey") includes the major- 
ity of accessions (Figure 4C). The other group ("white") in- 
cludes D, calciphila, D, labillardierei (population 13 and 
accession BT179), D, minimifolia (majority of individuals), 
D, pustulata, D, sp. Pic N'ga, D, tridentata (accessions 
BT206 and BT207), D. veillonii (accession BT224) and 
D, vieillardii (population 37 [except accession BT017], 
population 39 [except accession BTIOO] and population 
41). Seven individuals appear to be admixed (less than 
90% identity with one of the groups); most of those are 
D, vieillardii. Several species {D, labillardierei, D, minimi- 
folia, D, tridentata, D, veillonii and D, vieillardii) and even 
some populations comprise individuals belonging to each 
of the two groups. 

In order to quantify the amount of genetic variation be- 
tween species we have performed a non-hierarchical 
AMOVA with species assigned as "populations". This ana- 
lysis showed as little as 30% of the variation to occur 
among the species. However, in the STRUCTURE, PCO, 
NJ and BI analyses several species seemed to be formed by 
genetically distinct populations assigned to different clus- 
ters and coming in distinct positions in the tree. To avoid 
mixing up of cryptic variation within a group, we run 
further AMOVAs with populations assigned as sample lo- 
calities, despite the relatively low sample size per locality. 
Results of non-hierarchical AMOVA in this case indicate 
a higher level of differentiation between populations, 
resulting in an Fgr of 0.38. There was no visible difference 
in gene diversity between stands of co-occurring species 
and isolated populations. Several hierarchical AMOVAs 
(except one based on the STRUCTURE results) were not 
significantly more informative than the non-hierarchical 
AMOVAs (Table 2). Grouping populations according to 
geography or ecology, explains a surprisingly low amount 
of the variation (1.4 - 1.6%). Furthermore, allocating pop- 
ulations to the 21 included species assigns a relatively high 
percentage of variation at the between-species level 
(19.4%), but with a highly similar Fst value to the non- 
hierarchical AMOVA results. When higher-level group- 
ings paralleled the STRUCTURE results, we obtained the 
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Figure 1 (See legend on next page.) 
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f ^ 

(See figure on previous page.) 

Figure 1 Bayesian maximum clade credibility tree of New Caledonian Diospyros species based on plastid and nuclear DNA data 
(taken from Turner et al. [13]). Bold brandies are supported (> 70% bootstrap and Bayesian posterior probability). Accessions in blue 
correspond to the white group found in STRUCTURE and PCO, green ones to the grey group (light blue/green accessions included in current 
data set, dark blue/green accessions failed in current analysis but colour indicates the group to which they most probably belong), accessions in 
black are not included in the present study. 



highest FsT value (0.4), albeit the percentage of variation 
between the two clusters as defined by STRUCTURE was 
only 9.5%, lower than the percentage of differentiation 
shown between species. Removing seven admixed samples 
(less than 90% membership form each of the two groups 
based on the STRUCTURE results) from the AMOVA 
gave nearly the same results as the analysis including them 
(Table 2). 

The average gene diversity over loci within populations 
ranged from 0.03 in D. erudita (population 4) to 0.12 in 
D. parviflora (population 22). Contrary to predictions, the 



highest number of polymorphic sites, pair-wise differences 
and average gene diversity were not found in the admixed 
populations (according to the STRUCTURE results) but 
in D. parviflora (for details see Additional file 2). 

Discussion 

"Explosive" radiations featuring rapid opportunistic mor- 
phological and ecological diversification are phenomena 
previously reported for some islands (e.g. [29] and refer- 
ences therein). Extreme ancestral bottlenecks, together with 




Figure 2 Phylogenetic dendrograms inferred from the data collected in this study. Each species is shown in a different colour. Colours 
were selected randomly and do not indicate any grouping. A: Neighbour joining dendrogram based on Dice distances. Black dots indicate nodes 
with > 80% bootstrap support. B: Bayesian maximum clade credibility dendrogram. Black dots indicate nodes with > 0.95 Bayesian posterior 
probability. Picture credits: D. colciphilo: H. Benoit, www.endemia.nc; D. cherrieri: C. Chambrey; D. erudito, D. poncheri, D. pustuloto, D. umbroso, 
D. vieillordii: D. & I. Letocart, www.endemia.nc; D. flovocorpo, D. minimifolio, D. revolutissimo, D. sp. Pic N'ga: J. Munzinger; D. glons, D. porvifloro: 
J.-L. Ruiz, www.endemia.nc; D. impolita: J. Barrault, www.endemia.nc; D. labillardiereh B. Turner; D. perplexa, D. yahouensis: V. Hequet; D. veillonii: 
R. Amice, www.endemia.nc. 
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Figure 3 PCO of individual accessions based on Dice distances. Shading of the base-grid of the figure marl<s the two main groups inferred by 
STRUCTURE analysis - white and grey. Each species is shown in a different colour. Colours were selected randomly and do not indicate any grouping. 



on-going hybridization and incomplete lineage sorting, 
can prevent phylogenetic reconstruction in cases of island 
radiations if they have been recent and produced many 
species [30]. However, a good understanding of phylo- 
genetic relationships within radiating groups is key 
for further evolutionary studies into mechanisms and 
whether change is adaptive, due to drift in small popu- 
lations or other phenomena [29]. 

For the endemic New Caledonian Diospyros species, 
previous studies, based on multiple plastid [12] and low- 
copy nuclear [13] markers, showed 21 species to be closely 
related (Figure 1) and were not able to clearly resolve 
phylogenetic relationships among them. In the combined 
data set (plastid and nuclear markers; [13]) only seven of 
the 21 species included were found to form highly sup- 
ported groups of accessions from single species. Individ- 
uals belonging to each of the remaining 14 species failed 
to cluster according to their taxonomic circumscription. 
Dating analysis based on plastid and low-copy nuclear 
markers showed that the common ancestor of this clade 
of endemic New Caledonian Diospyros species has arrived 
in New Caledonia around nine million years ago [13]. 
Diospyros vieillardii has been shown to be sister to the rest 



of this endemic clade and separated from the rest of the 
species around 7.2 million years ago. 

Results of the current study using genome-wide AFLP 
markers reveal that most species form unique groups par- 
alleling recognised species. Around one-third (eight spe- 
cies, NJ dendrogram. Figure 2 A) and one-half (11 species, 
Bayesian tree. Figure 2B) of the species, are genetically 
distinct with high support (Figure 2). However, the overall 
AFLP results prove unable to clearly resolve the backbone 
of trees, similar to previous results obtained from analyses 
of DNA sequence data [13]. Intra-specific variation was 
greater (-80%) than that found at inter- specific level 
(-20%). This low ratio of among- versus within-species di- 
vergence in the context of considerable morphological 
and ecological divergence is indicative of a recent diversifi- 
cation [22]. Such a process can explain why we were able 
to get clear species boundaries for most species but 
were unable to clearly resolve phylogenetic relationships 
between them. 

Two species that did not form well-defined clades 
{D, minimifolia and D, parviflora) were previously consid- 
ered by White [31] to show variability in leaf morphology 
that may indicate that they are in fact a collection of 
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Figure 4 Results of the STRUCTURE analysis. A) Delta K values of the K values tested. B) Mean Ln likelihood of K values tested. C) Clustering 
of K= 2. The two groups are marked in different shades (white and grey). 



several species. For D. minimifolia White [31] mentioned 
that the type population (close to population 15 of this 
study) has smaller leaves compared to other populations 
of this species. In our results this population clusters to- 
gether with the majority of the D, minimifolia accessions; 
the population that is separated from the rest (population 
16) is from Gaji. According to White [31] D, parviflom is 
a wide-spread species, showing considerable variability of 
leaf morphology even within populations, making it im- 
possible to differentiate these into different species. Our 
results show all accessions of D. parvifiora, except those 
from Plateau de Tango (population 24), to form a group. 
All included accessions from D. parviflom are from ultra- 
mafic localities. 

To our surprise, the AFLP results do not show any sig- 
nificant grouping according to ecological (edaphic, cli- 
matic, elevational), geographical or morphological factors 
(Additional file 3). The two weakly differentiated groups 
revealed by STRUCTURE and PCO also do not corres- 
pond to any conspicuous phenotypic characteristics. The 



allele-frequency divergence between the two groups found 
by STRUCTURE is low, which explains why we did not 
observe the two groups in the Bayesian and NJ tree- 
building results. Taken together, these results indicate that 
positive selection has perhaps acted on few genomic re- 
gions [32] and has resulted in phenotypic diversification of 
New Caledonian Diospyros, Variation in copy number 
of specific genomic regions may be an additional aspect 
of molecular variation that, although invisible to AFLP 
markers, could form the basis of adaptation to different 
environmental conditions [33]. 

The individuals of D. vieillardii, D. umbrosa and D, fla- 
vocarpa form a minimally isolated group (squares in the 
grey group) in the PCO (Figure 3). Previous phylogenetic 
analyses (Figure 1) showed these three species to be sister 
to the rest of the taxa. Due to its morphological and eco- 
logical features D, sp. Pic N'ga from lie des Pins could be 
a hybrid between D, calciphila and D, vieillardii, but 
D. vieillardii is now not known from this island. In PCO, 
individuals of this putative species are located between 
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Table 2 Results of different AMOVAs conducted 



Percentage of variation 



Analysis 


Among groups 


Among populations within groups 


Within groups 


FST 


p value 


Non-hierarchical 




38.16 


61.84 


0.382 


0.00 


Species-wise 


19.43 


19.15 


61.42 


0.386 


0.00 


STRUCTURE 


9.46 


33.22 


57.32 


0.427 


0.00 


STRUCTURE no admixed 


9.93 


33.39 


56.68 


0.433 


0.00 


Geographic 


1.58 


36.97 


61.45 


0.385 


0.00 


Water 


1.37 


37.20 


61.43 


0.386 


0.00 


Soil 


1.54 


36.92 


61.54 


0.385 


0.00 



In the non-hierarchical analysis, no grouping was applied. In the species-wise analysis, samples were grouped according to taxonomic features (21 groups 
corresponding to the 21 species included). 

In the STRUCTURE analysis, samples were grouped according to the results of STRUCTURE analysis (two groups corresponding to the two groups - white and 
grey - inferred by STRUCTURE); in the analysis without admixed samples seven samples with less than 90% identity to one of the two groups were removed. In 
the geography analysis the samples were grouped according to their origin (three areas - north, middle and south - of New Caledonia). The analysis based on 
water availability was structured into two groups - humid and dry. In the soil-type based analysis, species were grouped according to the substrate on which 
they were found (five groups - limestone, serpentine, schist, ultramafic rock and volcanic rock). 



individuals of D. calciphila and D. vieillardii (Figure 3). 
The split between the two groups observed (Figures 3 
and 4) could be relatively old, separating two lineages 
that developed in isolated regions. For instance, dry pe- 
riods of the Pleistocene caused aridification in many 
areas, and some vegetation types persisted only in local 
refugia e.g. [34-36]. After climatic conditions became 
more favourable, the two groups probably expanded rapidly 
into newly suitable habitats where they overlapped; the time 
scale of these fluctuations (ca. 0.02 - 0.1 myr; [37]) was 
probably not enough to allow woody species with long gen- 
eration time such as Diospyros to diverge and become per- 
manently reproductively isolated [22]. There are a few 
admixed individuals in the STRUCTURE analysis (Figure 4), 
which implies that hybridization might have played a role 
in evolution of this group. 

Accelerated rates of evolution at few genes as a result of 
positive selection could have resulted in the morphological 
and ecological diversification apparent today in this group 
of New Caledonian Diospyros species. Furthermore, in 
addition to retention of ancestral polymorphisms, frequent 
gene flow could have acted against genome-wide genetic 
differentiation between the species. Barriers to gene flow 
between these species may be highly porous, with only few 
genes responsible for ecological and morphological adapta- 
tions evolving on distinct trajectories under strong selec- 
tion, which leaves the rest of their genomes open to gene 
flow [38]. Finding these few genes with AFLP is realistically 
improbable because they are a miniscule component in 
comparison the rest of these genomes. In the case of a re- 
cent and rapid radiation in plants, it could be argued that 
the bulk of regions sampled by AFLP have not evolved 
quickly enough to accumulate substitutions that could indi- 
cate species relationships. Our results are similar to those 
found in various other island genera (e.g. Araucaria in New 
Caledonia, [22]; Ourisia in New Zealand, [39]). 



Diospyros vieillardii, which is sister to the rest of the 
taxa belonging to this group of New Caledonian en- 
demics [12,13], is confined to ultramafic soils, which 
supports the hypothesis of this being an exaptation of 
the progenitor of this New Caledonian Diospyros clade 
to ultramafic sofls when the whole island was still cov- 
ered by heavy-metal-rich substrates; simflar findings 
have been made in other plant groups in New Caledonia 
e.g. [9]. Later, erosion reduced the extent of this geo- 
logical layer to one third of the island [7], and existing 
species began to move onto other substrates where they 
subsequently diverged, forming distinct species. Such 
observations have been made in various other New 
Caledonian groups (e.g. Araucaria, [22]; Spiraeanthe- 
mum, [35]; Codia, [40]). A few studies have examined 
the adaptive basis and processes involved in rapid radia- 
tions in New Caledonia e.g. [41] and Hawai'i (e.g. lobeliads, 
[42]; silverswords, [43]). Linking ecological parameters and/ 
or phenotypic traits associated with speciation has to be 
done with caution because range alterations, subsequent 
evolution, and species extinctions might have erased initial 
signals found in only a few genes. Therefore, the associa- 
tions observed today may be misleading, and the specific 
conditions/traits that were indeed linked to speciation, if 
any, may no longer be present [44]. 

Further work involving common garden experiments 
would provide insights into the effect of environmental 
conditions on morphological traits and therefore plasti- 
city of genomes of the New Caledonian Diospyros spe- 
cies. Unfortunately, such experiments are time and cost 
intensive. It is difficult to obtain ripe fruits of all Dios- 
pyros species, and in addition it is difficult to germinate 
and grow them, which is a crucial aspect of conducting 
such experiments. Reciprocal transplantation of seed- 
lings across environments are of course more easily 
conducted than common garden experiments, but they 
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are still time consuming and costly; in addition species 
adapted to one soil type often will not survive when trans- 
planted to other soil types. 

Conclusions 

Although New Caledonian Diospyros are morphologically 
and ecologically diverse, they show little genetic diver- 
gence (based on DNA sequences and AFLP data). In this 
case of the endemic clade of New Caledonian Diospyros^ 
AFLP data did not provide enough information to resolve 
phylogenetic relationships between the species, but it was 
sensitive enough for testing for the presence of genetic 
species boundaries. However, the AFLP results exhibit a 
good correlation with morphology-based species concepts. 
Further studies of this New Caledonian Diospyros group 
with deeper sampling of the genome using next generation 
sequencing methods are needed to get a clearer picture of 
the processes that formed this group. 

Methods 

Material 

Material from New Caledonian Diospyros species was 
collected on the main island (Grande Terre) and on a 
smaller island, lie des Pins. When possible, we collected 
five individuals per population. Collecting population 
samples from tropical trees/shrubs is not always easy be- 
cause the trees can be tall (and leaves therefore out of 
reach) and individuals are often far from each other. 
Collecting ten individuals in an area of ten square me- 
ters also does not make much sense for a study like this 
because these individuals are probably offspring from 
the same mother plant. As the focus of the present study 
is on the phylogenetic relationships between the species 
and not on population genetics within species, the au- 
thors consider the small size of the samples we collected 
to be sufficient. For widespread species, we collected 
populations throughout their range. For distribution of 
sampling sites, see Figure 5. From samples where fertile 
material was available, a voucher was made with several du- 
plicates sent to the herbaria at Noumea (NOU), University 
of Montpellier II (MPU) and the University of Vienna 
(WU). When sterile, one voucher per population was 
taken; this was compared to already existing collections 
in Noumea Herbarium (NOU) from the same location 
and referred to that species if similar. In total we in- 
cluded in the present study 231 individuals of New 
Caledonian Diospyros species, which correspond to 20 
identified and one unidentified species (due to absence 
of diagnostic reproductive organs at the time of collec- 
tion), giving 47 populations in total. Details of the 192 
individuals (43 populations) for which we were able to 
get useable results are given in Table 3. Silica-gel-dried 
material was used for DNA extraction. 



DNA extraction 

For DNA extraction, a modified sorbitol/high-salt CTAB 
method [46] was used (for details see [13]). 

AFLP 

Preparation and amplification of fragments followed the 
protocol of Vos et al. [14] with some modifications. Re- 
striction of genomic DNA with two restriction enzymes 
and ligation of double-stranded adaptors to the result- 
ing restricted fragments were performed in one step in 
a thermal cycler (Veriti, AB, Life Technologies; 37°C for 
2 h followed by a 30 min hold at 17°C). Reactions com- 
prised 1.1 (iL lOx T4 DNA ligase buffer (Promega), 
1.1 (iL 0.5 M NaCl, 0.55 \xL BSA (1 mg/ mL; New England 
BioLabs), 50 (iM Msel adaptors (genXpress), 5 (iM EcoRI 
adaptors (genXpress), 1 U Msel restriction endonuclease 
(New England BioLabs), 5 U EcoRI restriction endonucle- 
ase (New England BioLabs), 1 U T4 DNA ligase (Pro- 
mega), and 0.5 (ig DNA and were made up to a total 
volume of 11 (iL with water. Ligated DNA fragments were 
diluted 10-fold with sterile water. Preselective amplifica- 
tion reactions contained 1.14 (iL lOx RedTaq PCR reaction 
buffer (Sigma), 0.2 U RedTaq DNA polymerase (Sigma), 
0.22 (iL dNTPs (10 mM; AB, Life Technologies), 0.58 (iL 
preselective primer pairs (EcoRI-A and Msel-CT, each 
5 (iM; Sigma), and 2 (iL diluted restriction-ligation prod- 
uct, and were brought with water to a total volume of 
10 (iL. Amplification was carried out in the same machine 
used for restriction-ligation with the following profile: 
2 min at 72°C, 20 cycles of 10 sec denaturing at 94°C, 
30 sec annealing at 56°C, 2 min extension at 72°C, and a 
final extension step for 30 min at 60°C. The preselective 
PCR products were diluted 10-fold with sterile water. Re- 
actions for selective amplification contained 0.5 (iL lOx 
RedTaq PCR reaction buffer (Sigma), 0.1 U RedTaq DNA 
polymerase (Sigma), 0.11 (iL dNTPs (10 mM; AB, Life 
Technologies), 0.27 (iL Msel-primer (5 (iM; Sigma), 
0.27 (iL EcoRI-primer (1 (iM; Sigma), and 1 (iL diluted 
preselective amplification product and were brought to a 
total volume of 5 (iL with water. They were carried out in 
a GeneAmp PCR System 9700 (AB, Life Technologies) 
with the following profile: 1 min at 94°C, 9 cycles of 1 sec 
at 94°C, 30 sec at 65-57°C (reducing the temperature at 1°C 
per cycle), 2 min at 72°C, 25 cycles of 1 sec at 94°C, 30 sec 
at 56°C, 2 min at 72°C and a final extension for 30 min at 
60°C. The selective PCR products were purified using 
Sephadex G-50 Superfine (GE Healthcare Bio-Sciences) ap- 
plied to a MultiScreen-HV 96-Well Plate (Millipore) in 
three steps of 200 (iL each and settled at 750 g (1, 1 and 
5 min, respectively). The same speed was used for centrifti- 
gation of the samples (5 (iL of each selective PCR product), 
again for 5 min. Two microliters of the eluate were com- 
bined with 10 (iL HiDi and 0.1 (iL GeneScan 500 ROX 
(AB, Life Technologies) and denatured for 3 min at 95°C 
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1 Dumbea 

2 Between Plum and Prony 

3 Col de Prony 

4 Foret du Grand Kaori 

5 Port-Boise 

6 Kuebini 

7 La Roche Percee 
3 Pindai/Nepoui 
9Voh 

10 Between Kone and Poindimie 

11 Conservatoire botanique de Tiea 

12 Aoupinie 
13Netea 

14 Mandjelia 

15 Koumac 

16 Koumac 

17 Paagoumene 

18 Bale de Gadji 
19 Yahoue 

20 Foret de la Superbe (Montagne des Sources) 

21 Moindah 

22 Plateau de Tiea 

23 Plateau de Tango 

24 Nepou 

25 lie Kuebini 

26 lie des Pins 

27 lie des Pins, Pic N'ga 

28 Thio, To De 



50 
I 



100 km 
_l 




Figure 5 Map with sampling localities. Dots indicate sampling sites; tine numbers associated witli eacli dot refer to tlie list of sampling sites on 
this figure. Those numbers are used throughout the present study to characterize sampling sites. 



before running them on a capillary sequencer (3130x1 Gen- 
etic Analyzer, AB, Life Technologies). 

The selective primer pairs (6Fam-EcoRI-AGC/MseI- 
CTGA, Vic-EcoRI-ATG/Msel-CTCG and Ned-EcoRI- 
ATC/Msel-CTGA) were chosen because they generated 
clear and not too many bands (thus decreasing the risk 
of fragments co-migrating by chance), with sufficient 
variability in preliminary tests. Although the genome 
size of the New Caledonian Diospyros species (IC-value: 
1.5 - 2.3 pg; [13]) is smaller than the mean IC-value of 
eudicots (2.7 pg, [47]), we found the AFLP profiles gen- 
erated with Msel primers with four selective bases much 
clearer than those obtained from primers with just three 
selective bases. 

Reproducibility was checked by repeating ca. 80% of 
the samples. This high number of repetitions was neces- 
sary because of initial difficulties with fragment sizing. 

Scoring and phylogenetic analysis 

Sizing and scoring of the data was performed with Gene- 
Marker v2.2.0 (SoftGenetics). After pre-analysis using de- 
fault settings, sizing profiles of all samples were checked 
and where necessary manually corrected. Most of these 
corrections concerned one of the following peaks of the 



size standard: 35 bp, 50 bp and 139 bp. These peaks were 
often not correctly recognized by the GeneMarker pro- 
gram. High-quality sizing-profiles (score > 90) were ob- 
tained for all samples. A panel of scorable fragments was 
established for each primer combination, and fragments 
between 65-510 bp were scored. The relative fluorescent 
unit (RFU) threshold was set at 40. Automatic scoring was 
conducted using Local Southern peak call, peak saturation, 
base line subtraction, spike removal, pull up correction, 
and a stutter peak filter of 5% (as described in [48]). The 
results were exported as presence/absence matrix. The out- 
come of the automatic scoring was manually checked and 
corrected for errors. These errors mostly concerned peaks 
for which shape was atypical. In total 486 samples corre- 
sponding to 231 individuals were scored. From 186 indi- 
viduals replicate samples were performed (between two 
and five replicates per individual). The differences between 
the different samples (replicates) were counted and divided 
by the total number of phenotypic comparisons to get the 
error rate (calculated according to Bonin et al. [49]). After 
initial analysis (neighbour-joining, NJ) of the complete data 
set, replicates of samples and obviously failed samples were 
excluded from further analyses. As replicated samples of 
the corresponding individuals mostly clustered together. 



Table 3 Table of accessions; showing all Individuals used In this study 



Taxon Sample ID Population 

D. colciphilo F.White BT3 1 2-BT3 1 7 1 

D. cherrieri F.White BT262, BT276-BT278 2 

D. cherrieri BT293-BT297 3 

D. erudita F.White BT259-BT261, BT273-BT275 4 



D. erudita 


BT280-BT285, BT287 


5 


D. flovocorpo (Vieill. ex P.Parm.) F.White 


BT126-BT130 


6 


D. flovocorpa 


BT155, BT158-BT159 


7 


D. glans F.White 


BT020-BT022 


8 


D. glans 


BT075 


9 


D. glons 


BT082, BT084, BT087, BT093-BT094 


10 


D. impolita F.White 


BT101-BT105 


11 


D. inexplorata F.White 


BT304, BT307-BT311 


12 


D. labillardierei F.White 


BT121-BT125 


13 


D. lobillordierei 


BT178-BT182 


14 


D. mini mi folia F.White 


BT134-BT135 


15 


D. minimifolia 


BT230-BT234 


16 


D. minimifolio 


BT263-BT264, BT266-267, BT269-BT270 


17 


D. pancheri Kosterm. 


BT029-BT031, BT035 


18 


D. poncheri 


BT076-BT079 


19 


D. parvi flora (Schltr.) Bal<h. 


BT042 


20 


D. porvifloro 


BT059, BT062-BT063, BT068 


21 


D. porvifloro 


BT080, BT085, BT089-BT090 


22 


D. porvifloro 


BT248-BT250, BT252-BT253 


23 


D. porvifloro 


BT289-BT292 


24 


D. perplexo F.White 


BT147-BT151 


25 


D. pustuloto F.White 


BT111-BT114 


26 


D. pustuloto 


BT136-BT140 


27 



Sampling location 



Voucher 



26, littoral forest 

21, dry forest 

24, dry forest 
21, dry forest 



22, dry forest 

10, humid mountain forest 

12, dense humid mountain forest 

1, forest near river 

5, dense forest near road 

6, forest near river 

7, mesophyll forest near beach 
25, littoral forest 

10, river edge in mountain forest 

13, river edge 

1 1, dry forest 

18, mesophyll forest near beach 
21, dry forest 

2, forest near road 

5, dense forest near road 

3, wet forest 

4, wet dense forest 

6, forest near river 

20, humid forest at low elevation 

23, mountain forest 
1 1, forest near river 

8, dry forest 

1 1, dry forest 



JM6650, JM6653 
(MPU, NOU, P) 
NOU079551, WU062860 
NOU054492, NOU054008 
NOU079547 

WU062855, NOU079545 
NOU079544, WU062870 
NOU054010, WU062856 
NOU054011, WU062857 
WU062858, Chambrey &Tumer 20 (NOU) 
JM6625 (NOU) 
JM6632 (NOU) 

NOU053705, NOU030755, WU062846 

NOU000819 

NOU022860 

NOU019538 

NOU005818 

JM6624 (NOU) 

(NOU031346) 

NOU019556 

NOU019554 

NOU079549, WU062872 

NOU054493 

JM6619, JM6620 (NOU) 



NOU006656 
JM6622 (NOU) 
tree no. 23109 
NOU079550 
JM6630 (NOU) 



JM6629 (NOU) 



Table 3 Table of accessions; showing all individuals used in this study (Continued) 



D. pustulata 


BT257-BT258, BT265, BT268, BT271-BT272 


28 


21, dry forest 


NOU079548, WU062871 










NOU053999 


D. revolutissimo F.White 


BT116-BT120 


29 


9, maquis 


NOU023189 


D. revolutissimo 


BT218-BT222 


30 


17, maquis 


JM6640 (NOU) 


D. tridentato F.White 


BT202-BT207 


31 


15, dry forest at low elevation 


JM6639 (NOU) 


D. trisuica F.White 


BT185, BT192, BT197, BT199-BT201 


32 


14, mountain forest 


NOU031344, JM6637 (NOU) 


D. umbrosa F.White 


BT061, BT065-BT066, BT071, BT073 


33 


4, wet dense forest 




D. umbrosa 


BT170-BT171, BT175-BT177 


34 


13, dense humid forest 


JM6635 (NOU) 


D. umbroso 


BT246-BT247, BT251, BT254, BT256 


35 


20, humid forest at low elevation 


NOU023234 


D. veil Ion a F.White 


BT224, BT226-BT229 


36 


18, mesophyll forest near beach 


NOU019582 


D. vieillordii (Hiern) Kosterm. 


BT017, BT023-BT026 


37 


1, forest near river 


JM6618 (NOU) 


D. vieillordii 


BT055, BT057-BT058 


38 


4, dry open forest 




D. vieillordii 


BT088, BT091-BT092, BT098, BTIOO 


39 


6, forest near river 




D. vieillordii 


BT215-BT217 


40 


16, maquis 


NOU023242 


D. vieillordii 


BT324-BT325, BT328 


41 


28, forest near river 




D. yohouensis (Schltr.) Kosterm. 


BT237-BT239 


42 


19, mesophyll forest 


P00057340 


D. sp. Pic N'ga 


BT319, BT321-BT323 


43 


27, maquis 


JM6065 (NOU) 



The numbers of sampling localities are the same as In Figure 2. Voucher-Codes: JMXXXX: collection number J. Munzinger; Tree N° XXX: Tree of New Caledonian Plant Inventory and Permanent Plot Network (NC-PIPPN, 
[45]); NOUXXXXXXX: Herbarium accession number of Noumea herbarium (NOU); WUXXXXXX: Herbarium accession number of the Herbarium of the University Vienna (WU); P: Herbarium of the Natural History Museum 
Paris; MPU: Herbarium of the University of Montpellier II. 
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selection of samples from each individual for further ana- 
lyses was random and not according to any pattern or 
protocol For the final analyses we ended up with 192 
individuals. 

All three primer-combinations were combined in a 
single matrix and analysed together. Different distance 
measures were tested for their power to resolve rela- 
tionships with our data set. Distance matrixes were cal- 
culated in PAUP* v4bl0 ([50]; Nei-Li distance) and 
SplitsTree v4.12.6 ([51]; uncorrected P, Dice, corrected 
and uncorrected Hamming). Phylogenetic relationships 
based on previously mentioned distance matrices were 
reconstructed using SplitsTree v4.12.6 [51] to create 
unrooted NJ dendrograms. To assess robustness of 
branches NJ-bootstrap (NJ-BS) analyses were per- 
formed using SplitsTree v4.12.6 [51] and PAUP* v4bl0 
[50]. Bayesian inference (BI) was conducted with 
BEAST vl.7.5 [52], with two runs each 20 million gen- 
erations, sampling every 1,000^^ generation and re- 
moval of the first 30% of trees as burn in. 

To visualise the pattern of genetic clustering of indi- 
viduals and populations, we plotted principal coordinate 
analysis (PCO) using the R-package scatterplotSd [53] 
based on an individual Dice distance matrix, and re- 
spectively, on AMOVA-derived pair-wise Fst distances 
calculated with Arlequin v3.5.1.2 [54]. To investigate 
further significant groupings of the included individuals 
we used the program STRUCTURE v2.3.3 [55,56] on 
the Bioportal computing cluster of the University Oslo 
[57]. We ran STRUCTURE for K = 1-23 with 10 repli- 
cates each and a model based on admixture and inde- 
pendent allelic frequencies, without taking into account 
information regarding sampling localities. Each run had 
3 million iterations with 10% additional burn in. The 
calculation of deltaK (AK; [27]) and preparation of the 
input file for Clumpp was done with Harvester [58]. 
Production of a combined file from the ten replicates 
of the best K was perfomed using Clumpp vl.1.2 [59] 
with the full search algorithm. The graphical represen- 
tation of STRUCTURE results was prepared with 
Distruct vl.l [60]. 

Both non-hierarchical and hierarchical analyses of mo- 
lecular variance (AMOVA) and calculations of population 
statistics were conducted using Arlequin v3.5.1.2 [54]. For 
hierarchical AMOVAs groups have been defined based on 
different possible clusterings (Additional file 4) according 
to STRUCTURE results, taxonomy, distribution patterns 
and ecological traits. 



Availability of supporting data 

AFLP presence/absence matrix and phylogenetic analyses 
are deposited in treeBASE under study 14798 (http://purl. 
org/phylo/treebase/phylows/study/TB2:S14798). 



Additional files 



Additional file 1: STRUCTURE results of suboptimal K values (3, 6, 16 
and 21) in comparison with K =2. Delta K likelihoods are given for each K. 

Additional file 2: Table showing the population statistics inferred 
from non-hierarchical AMOVA based on STRUCTURE results. 

Populatior^s marked bold differ in this analysis from the general 
population grouping given in Table 3. 

Additional file 3: Figure of the neighbour joining dendrogram 
coloured according to soil type (colour of the branches) and water 
availability (colour of taxa names). This dendrogram is the same as 
Figure 3A, but coloured according to ecological features. 

Additional file 4: Table giving the details of the different AMOVAs 
conducted. The numbers in the populations column are the same as 
given in Table 3, respectively, in Additional file 1 for the STRUCTURE 
based AMOVA. 
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